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Abstract 

We study properties of a simple random walk on the random digraph D n ^ p when 
np = dlog n, d > 1. 

We prove that whp the stationary probability ir v of a vertex v is asymptotic to 
deg~ (v)/m where deg~(v) is the in-degree of v and m = n(n — l)p is the expected 
number of edges of D n ^ p . If d = d{n) — > oo with n, the stationary distribution is 
asymptotically uniform whp. 

Using this result we prove that, for d > 1, whp the cover time of D n p is asymptotic 
to dlog(d/(d — l))nlogn. If d = d(n) — > oo with n, then the cover time is asymptotic 
to nlogn. 



1 Introduction 

Let D = {V,E) be a strongly connected digraph with \V\ = n, and \E\ = m. For the simple 
random walk W v = (W 1) (t), t — 0, 1, . . .) on D starting at v G V, let C v be the expected time 
taken to visit every vertex of D. The cover time Cd of D is denned as Cd 

— max^gy Cv- 

For connected undirected graphs, the cover time is well understood, and has been extensively 
studied. It is an old result of Aleliunas, Karp, Lipton, Lovasz and Rackoff [2] that Cq < 
2m(n — 1). It was shown by Feige [10], [11], that for any connected graph G, the cover time 
satisfies (1 — o(l))n\ogn < Cq < (1 + o(l))^n 3 , where logn is the natural logarithm. An 
example of a graph achieving the lower bound is the complete graph K n which has cover 
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time determined by the Coupon Collector problem. The lollipop graph consisting of a path 
of length n/3 joined to a clique of size 2n/3 has cover time asymptotic to the upper bound of 
(4/27)n 3 . 

For directed graphs cover time is less well understood, and there are strongly connected 
digraphs with cover time exponential in n. An example of this is the digraph consisting of 
a directed cycle (1,2, ...,n, 1), and edges (j, 1), from vertices j = 2, ...,n — 1. Starting from 
vertex 1, the expected time for a random walk to reach vertex n is Q(2 n ). 

In earlier papers, we investigated the cover time of various classes of (undirected) random 
graphs, and derived precise results for their cover times. The main results can be summarized 
as follows: 

• [4] If p = dlogn/n and d > 1 then whp Ca np ~ d\og (^ty) nlogn. 

• [7, 8] Let d > 1 and let x denote the solution in (0, 1) of x — 1 — e~ dx . Let X g be the 
giant component of G n>p , p = d/n. Then whp C Xg ~ j^z^d) n ^°S n ) 2 ■ 

• [5] If r > 3 is a constant and G n , r denotes a random r-regular graph on vertex set [n] 
with r > 3 then whp Ca nr ~ ^j^nlogn. 

• [6] If m > 2 is constant and G m denotes a preferential attachment graph of average 
degree 2m then whp Cc m ~ ^rj-^logn. 

• [9] If k > 3 and G r ^ is a random geometric graph in of ball size r such that the 
expected degree of a vertex is asymptotic to d log n, then whp Cc r k ~ log (^rr) n log n. 

A few remarks on notation: We use the notation a(n) ~ b(n) to mean that a(n)/b(n) -> 1 as 
n — y oo. Some inequalities in this paper only hold for large n. We assume henceforth that n 
is sufficiently large for all claimed inequalities to hold. All whp statements in this paper are 
relative to the class of random digraphs D n ^ p under discussion, and not the random walk. 

In this paper we turn our attention to the cover time of random directed graphs. Let D n ^ p be 
the random digraph with vertex set V = [n] where each possible directed edge (i, j), i ^ j is 
independently included with probability p. It is known that if np = d log n = log n + 7 where 
7 = (d — 1) logn — > 00 then D n p is strongly connected whp. If 7 as defined tends to —00 
then whp D np is not strongly connected. As we do not have a direct reference to this result, 
we next give a brief proof of this. It is easy to show that if np = logn — 7 where 7 — > 00, there 
are vertices of in-degree zero whp. On the other hand, if np = logn + 7 where 7 — > 00 then 
[12] shows that the random digraph is Hamiltonian and hence strongly connected. Strong 
connectivity for np = logn + 7 where 7 — > 00 also follows directly from the proof of (60). 

We determine the cover time of D n ^ p for values of p at or above the threshold for strong 
connectivity. 
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Theorem 1. Let np = dlogn where d = d(n) is such that 7 = np — logn — > 00. Then whp 
Note that if d = d(n) —> 00 with n, then we have C/j np ~ nlogn. 

The method we use to find the cover time of D n>p requires us to know the stationary dis- 
tribution of the random walk. For an undirected graph G, the stationary distribution is 
7r„ = deg(v)/2m, where deg(v) denotes the degree of vertex v, and m is the number of edges 
in G. For a digraph D, let deg~(v) denote the in-degree of vertex v, deg + (f) denote the out- 
degree, and let m be the number of edges in D. For strongly connected digraphs in which each 
vertex v has in-degree equal to out-degree (deg~(t>) = deg + (t>)), then n v = deg~(v)/m. For 
general digraphs, however, there is no simple formula for the stationary distribution. Indeed, 
there may not be a unique stationary measure. The main technical task of this paper is to 
find good estimates for 7r v in the case of D n}P . Along the way, this implies uniqueness of the 
stationary measure whp. 

We summarize our result concerning the stationary distribution in Theorem 2 below. For a 
given vertex v, define a quantity <j*(f), which in essence depends on the in- neighbour w of v 
with minimum out-degree: 

;>)= max IpW). (i) 



•j&n-(v) I deg (to) 

Theorem 2. Let np = dlogn where d = d(n) is such that 7 = np — logn — > 00. Let 
m = n(n — l)p. Then whp, for all v G V, 

deg~{v) + s*(v) 



in 



We note the following special cases. 

Remark 1. We prove in Lemma 14 that whp s*(v) = o(deg~(v)) for almost all vertices v. 
For these vertices, the s*{v) term can be absorbed into the error term 0/717. 

Remark 2. If '7 = w(loglogn) then whp <?*(t>) = o(deg~(v)) for all vertices v. In particular 
when d = 1 + 5, 5 > and constant then the minimum out-degree is fi(logn), in which case, 
ir v ~ deg~(v)/m. 

Remark 3. If d = d(n) — > 00 with n, whp the stationary distribution of D n p is ir v ~ 1/n. 



2 Outline of the paper 

At the heart of our approach to the cover time is the following claim: Suppose that T is a 
"mixing time" for a simple random walk and A v {t) is the event that W u does not visit v in 
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steps T, T + 1, . . . , t. Then, essentially, 

Pr(A,(*)) ~ e~ t7rv/Rv . (2) 

Here Ry > 1, v G V is the expected number of visits/returns to v by the walk W v within T 
time steps. This is the content of Lemma 3 and we have used it to prove our previous results 
on this topic. Given (2) we can estimate the cover time from above via 

V s>t 

This is (93) and we have used this inequality previously. Here C u is the expected time for 
W u to visit every vertex. It is valid for arbitrary t and we get our upper bound for Co by 
choosing t large enough so that the double sum is o(t). 

We estimate the cover time from below by using the Chebyshev inequality. We choose a set 
of vertices V** that are candidates for taking a long time to visit and estimate the expected 
size of the set of vertices in V** that have not been visited within our estimate of the cover 
time. We show that E| V' \ — > oo. We then take pairs of vertices v, w G V** and contract them 
to a single vertex 7 and then use (2) to show that Pr(A 7 (t)) ~ Pr(A v (t))Pr(A w (t)) . 

The main problem here is that we do not know 7r„ and much of the paper is devoted to proving 
that, essentially, whp, 

Kv „ deg ^ for all v e V. (3) 

m 

Our proof of this leads easily to a claim that whp T = 0(\og 2 n) and we will find then find 
that it is easy to prove that R v = 1 + o(l) for all v G V. 

We approximate the stationary distribution it using the expression 7r = TzP k , where P is the 
transition matrix. For suitable choices of k we find we can bound 

Pi k \y) = Pr(W x (k) = y) 
from above and below by values independent of x and obtain, essentially, 

m 

an expression independent of x. (3) follows easily from this. 

To estimate Px k \y) from below we proceed as follows: We let k = 21 = | log np n. We consider 
two Breadth First Search trees of depth I. T l ° w branches out from x to depth I and T l ° w 
branches into y from depth i. Almost all of the walk measure associated with walks of length 
2£ + 1 from x to y will go from x level by level to the boundary of T^, ow , jump across to the 
boundary of T l ° w and then go level by level to y. We analyse such walks and produce a lower 
bound. 
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To estimate Px (y) from above we change the depths of the out-tree from x and the in-tree to 
y. This eliminates some complexities. In computing the lower bound, we ignored some paths 
that take more circuitous routes from x to y and we have to show that these do not add much 
in walk measure. 

The structure of the paper is now as follows: Section 3 describes Lemma 3 that we have often 
used before in the analysis of the cover time. Section 4 establishes many structural properties 
of D Hy p. In Section 5 we prove the lower and upper bounds given in Theorem 2. These bounds 
hold for any digraph with the high probability structures elicited in Section 4. Sections 4 
and 5, which form the main body of this paper, are first proved under the assumption that 
2 < d < n s , for some small 5 > 0, an assumption we refer to as Assumption 1. In Section 
6, we extend the proof of Theorem 2 by removing Assumption 1. Section 7 is short and 
establishes that the conditions of Lemma 3 hold. To do this, we use a bound on the mixing 
time, based on results obtained in Sections 5, 6. Finally, in Section 8 we establish the whp 
cover time, as given in Theorem 1. 



3 Main Lemma 



In this section D denotes a fixed strongly connected digraph with n vertices. A random walk 
W u is started from a vertex u. Let W u (t) be the vertex reached at step t, let P be the matrix 
of transition probabilities of the walk and let Pu\v) = Pr(W M (t) = v). We assume that the 
random walk W u on D is ergodic with stationary distribution ir. 

Let 

d(t) = max \Pjf\x) — tt x \, 

u,x&V 

and let T be a positive integer such that for t > T 

max \P®(x) -it x \< n~ 3 . (4) 

u,x£V 

Consider the walk W„, starting at vertex v. Let r t = r t (v) =Pr(VV v (t) = v) be the probability 
that this walk returns to v at step t — 0, 1, ... . Let 

T-l 
j=0 

and let 

R v = -Rt(I)- 

Lemma 3. Fix a vertex u G V and for v 6 V and t >T let A v (t) be the event that W u does 
not visit v in steps T, T + 1, . . . , t. Suppose that 



5 



(a) For some constant 9 > 0, we have 



min \R T (z)\ > 9. 

\z\<\+\ 



(b) T 2 n v = o(l) and Tir v = n(n~ 2 ) for all v e V. 



Let K > be a sufficiently large absolute constant and let 



KT 

Then, with 



* = ( 6 ) 



Pv R v (l + 0(Tn v )y (7) 



we have that for all v G V and t >T, 



1 + 0(Ttt v )) t ^ (rr2 xt/2 ^ 



Pr(A v (t)) = v ^ v °" + 0(T\ v e 

i Pv) 



4 Structural Properties of D n ^ p 

In this section we gather together some whp properties of D n p needed for the proof of Theorem 
2. Some are quite elaborate and so we will try to motivate them where we can. 

We stress that throughout this section, the probability space is the space of D n p and not the 
space of walks on an instance. 

Once we complete this section however, we can concentrate on estimating the cover time of a 
digraph with the given properties, as long as Assumption 1 of (20) holds. For large p outside 
Assumption 1, the proof is quite simple. 



4.1 Degree Sequence etc 

Chernoff Bounds The following inequalities are used extensively throughout this paper. 
Let Z = Z\ + Z2 + • • • Z]\f be the sum of the independent random variables < Zi < 1, % = 
1,2,..., N with E(Zi + Z 2 + ■ ■ ■ + Z N ) = Nfi. Then 

Pr(\Z - Nfi\>eNfi) < 2e~ £2 ^ /3 . (9) 
Pr(Z>aNfi) < {e/a) aN>M . (10) 

For proofs see for example Alon and Spencer [3]. 
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The next lemma gives some properties of the degree sequence of D n p . The lemma can be 
proved by the use of the first and second moment methods (see [4] for very similar calculations). 

Let np = d log n and let 

Ao = C$np where Co = 30. (11) 

Lemma 4. 

(i) First assume that np = dlogn where 1 < d = 0(1) and (d — 1) logn — > oo. Let 

D{k)=n("~ iy ) 1 f(l-p) n - 1 - k 
denote the expected number of vertices v with deg~(v) = k < A . Note that 

Let D(k) denote the actual number of vertices of in-degree k, and let 

K = {fce[l,A ]: D(k)<(lognT 2 }. 

Ki = {1 < k < 15 : (logn)^ 2 < D(k) < log logn}. 

K 2 = {k E [16, A ] : (logn)- 2 < D(k) < (logn) 2 }. 

K 3 = [l,Ao]\(^oUJ^UK 2 ). 

The degree sequence has the following properties. 

(a) Ifd-1> (logn)- 1 / 3 then 

Ki = 0, min{£; e K 2 } > (logn) 1/2 , \K 2 \ = O (log logn). 

(b) The following holds whp; For all degrees k £ [1, Ao]. 

k e K , D{k) = 0, 

k e K u D(k) < (log logn) 2 , (13) 
ke K 2 , D(k)<(\ogn)\ (14) 

k G K 3 , < D{k) < 2D(k). (15) 

(ii) Suppose that 1 < d < n 5 where 5 is a small positive constant. Let k* = \(d — 1) logn] . 
Let 

V* = \y G V : deg~(v) = k* and deg + (v) — k* — [dlogn]} 
and let 7^ = (d — 1) log (^zf) ■ Then whp 



\V*\ > 



n 



id 



lOdlogn 
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(Hi) LetT> be the event 

{3v G V : deg + (v) > A or deg~(v) > A } , (16) 

then 

Pt(V) < n- 10 e~ 10np . (17) 

(iv) The number of edges \E[D n>p \ ~ m = n(n — l)p whp. 

(v) deg ± {y) ~ np for all v G V whp if d — > oo. 

Proof We will only give an outline proof of (ii) as the other claims have (essentially) 
been proved in [4]. We have 

E (in) = n( n ~ x ) (\~ i y^ k \i- P f-^- k ' 

Using, (™) > ^rj2 (if)*' wn i cn i s obtainable from Stirling's approximation, we see that 

1 / rl \ ( d - 1 ) lo s n 

E(ri)>(l-o(l)) 9(rf _ 1)1/2J/21ogii (— ) . (18) 

The Chebyshev inequality will show that |V*| is concentrated around its mean, provided we 
verify that the mean tends to oo. 



If d > 2 then 



, \ (d-l)logn / j 



> exp < (d — 1) logn I - — - — — — — — — > > n 



,V2 



Now d < n 5 and so E(|V*|) — > oo follows from (18). Ifd=l + e<2 and e is bounded 
away from zero then so is (^ry) 1 and so E(|V*|) = n^ 1 ^. So now suppose that e = 

where a; = tu(n) — > oo and to = o(logn). Then, (jzi) d 1 > (^frO^- If a; > log 1 / 2 n then 
(^i)^ 1 > e logV2n and if u < log 1/2 n then (^Y" 1 > log w/2 n. In either case 

E(|V*|) > \og e n where 9 = 6{n) -> oo (19) 

□ 

4.1.1 Assumption 1: a convenient restriction 

We will first carry out the main body of the proof under the following assumption: 

Assumption 1 : 2 < d < n 5 . (20) 
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Here < 5 <C 1 is some small fixed positive constant. We note that our choice of the 
value d > 2 is somewhat arbitrary, and any constant larger than 1 would suffice. We wait 
until Section 6 to remove Assumption 1. The proof for d > n s is much simpler and is given 
separately in Section 6.1. The proof for 1 < d < 2 is given in Section 6.2. 

Under Assumption 1 and d = 0(1) there is a constant c > and an interval 

I=[c np,A ], (21) 

such that if v G [3n/4, n] then there exists 7 = 7(c) > such that 

Pr(Bin(u,p) e I) = 1 - o^" 1 " 7 ). (22) 

When d-^oowe can take c = 0.999 and Co = 1.001. 

Let Sg (resp. 8$) be the event that the in-degree (resp. out-degree) of all vertices in S C V 
are in the interval /. Thus e.g. 

8+ = {D n . p : Vv E S C V, deg + (v) e [c np, A ] } . (23) 

Let 8s = 8g n 8$ . Then for any S CV we have 

Pr(£ 5 ) = 1 -0{n^). (24) 



4.2 Properties needed for a lower bound on the stationary distri- 
bution 

The calculations in this section are made under Assumption 1. 

Fix vertices x, y where x = y is allowed. Most short random walks from vertex x to vertex 
y take the form of a simple directed path, or cycle if x = y. We can count such paths (or 
cycles) with the help of a breadth first out-tree T l ° w rooted at x, and a breadth first in-tree 
rplow roo t ec [ qx y_ \y e build these trees to depth £, where 



(25) 



For a vertex v let iV (v) be the set of in-neighbours of v and for a set S, let N (S) 
[j vGS N~(v). Define N + (v ), N + (S) similarly with respect to out-neighbours. 



Construction of in-tree T l ° w '. 



For fixed y G V, we build a tree T y = T l y ow rooted at y, in breadth-first fashion. Define 



Y = {y}. The tree Ty has level sets Y i} i = 0,1,...,^, and vertex set Y = Li i=0 Yi. Let 

Y<;, = U-;<,;Y,-. 
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Let T y (i) be the tree consisting of the first i levels of the breadth first tree T y (£) = T l ° w . 
Given T y (i) we construct T y (i + 1) by adding the in-neighbours of Y$ in V \ Y<j. To remove 
ambiguity, the vertices of Yi are processed in increasing order of vertex label. Let this order 
be (vi, t>2, v\Yi\)- For v G Yi, the set N^(v) is the subset of Y i+ i R N~(v) whose edges in the 
tree T y {i + 1) point to v. Formally these sets are defined as follows: (i>i) = N~(v\) \ Y<i, 
and in general N^(v k ) = N~(v k ) \ (Y<; U N~(vi, u fc _i)). 

Thus if v G and w G Y"i + i and (u>,i>) is an edge of T y {i + 1), then v = v k is the first 
out-neighbour of w in Y"j in the order (vi, ...,Vk). As w G Yi+i, there are no edges from w to 
y<i-i, and thus no edges between w; and F<j_i U Vfc-i}. 

Let degy(w) denote the in-degree of v G Y in T^ ™. If deg^(-u) > for all v G F<^-i, we say 
t/ie construction of T l ° w succeeds. 

Associated with this construction of T l y ow is a set of parameters and random variables. 

• For Vj G Yi, let a(vj) = \V \ [Y<i U (vi, Vj-i)]\. Thus cr(vj) is the number of 
vertices not in T y after all in-neighbours of v\, ...,Vj-\ have been added to T y . 

• Let B{vj) ~ Bin(a(vj),p). Thus -B(wj) = \N^(vj)\ = deg^(u,), the in-degree of Vj in 

T 
± y . 

• Let (j'(vj) = \ |>^i_i U 

• Let D ~ 1 +Bin(a'(vj),p), and let -D(j, fc), fc = 1, B{vj) be independent copies of D. 

The interpretation of the random variable D(j, k) is as follows. If w k G N^(vj) then -D(j, A;) 
is the out-degree of Wk in D n>p . The one arises from {wk, Vj) being the first edge from Wk to 
Yi. 

Construction of out-tree T l ° w . Given the set of vertices Y of T y ow , we define X = 
{x} ,X u ...,X e where X i+1 = N+(Xi) \ (Y U X U ■ ■ • U X t ) for < % < £. If w G 
is the out- neighbour of more than one vertex of Xj, we only keep the edge (z,w) with z as 
small as possible as in the construction of Ty™. Let X = \J i=Q Xi and let T 1 ™ denote the 
BFS tree constructed in this manner. Let deg^(w) = \N^{v)\ denote the out-degree of v G X 
in T^. Similarly to the construction of T l y ow , the value of degj(w) is given by a random 
variable B{v) ~ Bin(a(v),p). If degj^(u) > for all v G A<^_i, we say the construction of 
Tjf" succeeds. 

We gather together a few facts about T!f w , T l y ow that we need for the proofs of this section. 
Lemma 5. 

(i) With probability 1 — 0(n~ 7 ), the construction of T l ° w ,T l ° w succeeds for all x,y G V. 
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(ii) With probability 1 — 0{n~ 1 ), 

For all x, and for all v G X<^x, deg^(v) G [c np(l — o(l)), C^np], 
For all y and for all v G Y<g_i, deg^{v) G [conp(l — o(l)), C$np\. 



(iii) Given £ x ,£ y , then for i < £, \Xi\ ~ deg (x)(np) 1 , \Yi\ ~ (ieg (y)(np) 1 qs 
Proof We give proofs for T x ° w , the proofs for Ty are similar. 

Part (i), (ii). Let X = {xq = x,x±, . . . ,xn} where x« is the i-th vertex added to T l ° w . For 
Xj G X, let f(xj) = \N + (xj) n (Y U {x ,xi, . . . Thus deg + (w) = deg^(w) + /(v). 

We can bound f(v) by the binomial Bin(Nx,p) where Nx = |Y| + |X|. This is true even after 
constructing T x ow ,Ty OW , because the out-edges of v counted by /(f) have not been exposed. 
Assuming -CD, see (17), we have 

i 

N x <2j2^o = n 2/3+o{1) . 

i=l 

Using the Chernoff bound (10), we have with u = log 1//2 n that 

Pr > ^) < Pr (Bin^n 2 '^ ^^) > ^) = 0(n~ w ). (26) 

The event [j xev £x <e _ C Sy and the latter holds with probability 1 — 0{n~~ f ). Thus given (26), 
and £y we have degj(f ) > for v G X<^_i for all x G V. In summary, whp the construction 
of T l x ow succeeds for all x G V, and deg^(-u) G [conp(l — o(l)),Conp] for all t> G X<e-i in all 
trees T^, x EV. 

Part (iii). By construction T l ° w was made after T^, so \Xi\ depends on T l ° w and T l y ow . 
Assume £ x = |deg + (x) E I = [c np, C np] } , and that |Y| < ri 2 / 3 +°( 1 ). (Strictly speaking we 
should verify that |Y| < n 2 / 3 +°( 1 ) before considering Tj ou; . On the other hand, the proof we 
present now can be applied to Ty OW ). For % > 1, is distributed as Bin{n — o(n), 1 — 

(1 — p)' Xi '). The number of trials n — o(n) is based on the inductive assumption that = 
(1 + o(l))n\Xj\p and that |AT<j|p = o(l). That these assumptions hold qs. follows from the 
Chernoff bounds. We thus have that 

\X e \ ~deg + (x)(n P y-\ qs. (27) 

□ 

For u G Xj let P u denote the path of length i from x to u in T l ° w and 

W A deg (w) 



X A sequence of events A n ,n > hold 'quite surely'(qs) if Pr(A n ) = 1 — 0(n K ) for any constant K > 0. 
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In the event that the construction of T l ° w fails to complete to depth £, let ^2 uGXe a^ u = 0. 
Similarly, for vE^ let Q v denote the path from v to y in Ty OW and 

Pi,v = II T^r~Y < 28 ) 

In the event that the construction of Ty OW fails to complete to depth £, let J2 v€Ye Pe,v — 0. 
Let 

Z{x, y) = Z l ™(x, y) = V ^— (29) 

where l w is the indicator for the existence of the edge (u, v) and we take de g+"( M ) = if 
deg + (u) = 0. Note that Z(x, y) = if we fail to construct T l ° w or ij" . 

Remark 4. TTie importance of the quantity Z(x, y) lies in the fact that it is a lower bound 
on the probability that W x (2£ + 1) = y. 

The aim of the next few lemmas is to prove (see Lemma 9), under Assumption 1, that with 
m = n{n — l)p, 



Pr (s XtyeV such that Z(x,y) ^ + ^ = 0(n 



1 ) 



(30) 



The first two lemmas give whp bounds for Yl ueX ct^ u , ^2 veY Pi,v respectively, as used in the 
third lemma and its corollary. 

Lemma 6. Let 

(31) 



Ai(x, y) = < 1 - e x < ^2 a£ > u - 1 f • 



For e x = 0(l/Vlog n), 

Pr (-^Axix, y) n £+) = o(n~ 10 ). (32) 

Proof 

For u G Xi let xP u = (uq = x, u\, ■ ■ ■ , Ut = u) denote the path from x to u in T%"° '. For 
the random walk on the digraph T^ ow , starting at x; Xi is reached with probability $ = 1 in 
exactly i steps, after which the walk halts. Thus 

i = *=£ U^La^Y, <*>,«■ < 33 > 

uex e veP u aegT[V) uex e 
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We assume that the construction of T l ° w succeeds, and that degj(-u) > for v G X<£_i, as 
established in Lemma 5. In the notation of that lemma, deg + (-u) = deg^(f) + f(v). Now 

* = En 



, deg (v) — f(v) 



Now if 



h — f7 ^ +( ^ 1 + h VnG ^' ( 34 ) 

^ 1 ~ f{ v )/deg + {v) 



then J^uex ct£ )U = 1 — o(l) provided h = o(l). We next prove we can choose h = 0(l/i/logn) 
which determines our value of ex- 



Similar to the proof of (26) of Lemma 5 we have, with u = y/\ogn that 



Pr j f( v ) >—) < Pr (Bin{ 

\v£xPu / 



n 2 / 3 +°W,p)>^)=0(n- 10 ). (35) 



Using (26), and (35) it follows that 



j2 M < 



v€Pu ^ ~ W ^ ~ 1 

For < x < 1, (1 — x) _1 < e 3 ^ 1-3 ^, and so 



deg + (w) - /(w) 



^G^r) = 1+ °©' (36 > 

There are at most n trees and n paths per tree and so (34), with ex = h = 0(l/u) = 
0(l/\/log n), follows from (36). This completes the proof of (32). □ 

The next step is to obtain an estimate of YlveY Pi,v- The proof is inductive, moving down the 
tree T y level by level. For brevity we write d + {u) = deg + (w), d^(u) = deg^(-u) etc. 

Let the random variable W(y,i) be defined by 
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d+(z)' 

ueJV- (3/) veYi z&Pu y ' 

where for v G Yj the notation means that the the unique path vPuy from v to y in T y passes 
through u, and that wPu is written as v = Zi, Zj, z\ = u in the product term. 

Note that 

Define W*(y,i) by 

wm= e z^) n ^ 

where for v G Y« we define d^(v) = 1 so that W(y,£) = W*(y,£). Note that 

d^,(u) 



W*(y,l)= Yl 



d+(u) 



We prove the following lemma for a more general value of £, as it is also used in our proof of 
the upper bound. 



Lemma 7. Let 



A 2 (y)=lD n>p :J2^ = ^ + 0(l/V^))^ E ( 37 ) 

Let t = 77 log n where < 77 < 2/3. Then under Assumption 1, 

Pr(3y G V such that ^A 2 (y)) = 0(n" 7 ). 



Proof 

The lemma is proved inductively assuming £~ and £y\{ y }- We prove the induction for 2 < 
% < £, where by assumption (np) e = 0(n°' 67 ). 

Let E ,+ W(y, i) be the expectation of W(y, i) over (d + (v), v G Yj), conditional on all other 

degrees d + {u) > 0,d^(u),u G Y<j_i being fixed such that |Y<j_i| ~ <i~(?/)(np) l ~ 1 < n ' 67 
which is true qs from Lemma 5. 

For v G Yi, d + {v) is distributed as D{v) ~ 1 + Bin(a'(v),p), for some cr'(?j) G Jo = [n — 
0(n°' 67 ),n]. Given the values o~'(v) for v G Yj, the -D(w) are independent random variables. 
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For v G Yj, let vPu be written vwPu, where (v,w) G T r Then 

E [d + W] i n dMz)) =E (^)) ( n d+u) 

where given £y\/ y }> an d = max(n -0 ' 33 , n~ 7 ) 



[j^rs) = (i + o(5))-. 

D(v) J np 



This follows from the identity 

N 



3=0 

obtained by integrating (q + px) N ; and from Pr(-i£+) = 0(n~ 1 ~' y ). Thus 



Vw."^ = d + E e (n 5^,) 

= p+o< e <?<«> (n ^ 

iueYi_i \zewPu y 

= (l + (5))—W*(y,i-l). 
np 

To obtain a concentration result, let = • ((1 — o(l))c np) 1 , we can write {/(«) = 

X^eK wnere £4 are independent random variables. Assuming £y\( y } an d that Lemma 
5(h) holds we have (c(l - o(l))/C )'' < CA, < 1. 



Let 6i = ^/3Klogn/(EU) for some large constant K. Then 

Vr(\U(i) - EU\ > eEU) < 2e"^ El/ = 0( 
and so 

Pr(\W(y,i) - EW\ > eEW) = 0(n~ K ). 

Note that EU > \Yi\(c(l - o(l))/C ) i > (c/2)(cnp/2C )\ Thus e* < l/y/(A log n)^ 1 for some 
A > constant. For z > 2, e,j = 0(l/yTogn), and thus = o(l). 

In summary, with probability 1 — 0(n~ K ), 

W(y, i) = (1 + 0(5) + Ofo)) — i - 1). 
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Continuing in this vein, let E »- W*(y,i — 1) be the expectation of W*(y,i — 1) over 

\CL T (i— 1)1 

(d^,(v),v G Yi_i), conditional on all other degrees (d + (u),d^,(u),u G Y<i_ 2 ) being fixed. For 
v G Yi-i, d^{v) is distributed as B{v) ~ Bin(cr(v),p) conditional on £y\Y t - Let denote the 



indicator for an event X. then 



EB(v) = E(B(v) ■ 1 £ -J + E(B(v) ■ 1^ ] 



and, splitting the second event on T> gives 



E(B(v) ■ 1_ ) = O(A n-i) + 0(nn~ w ). 

y \i£ 

Thus, given £y\Y t we nave Ed^(v) = (1 + 0(S))np. 
Thus 

(<*«> n ^i,) = ( n ^ 

and 

E [^ (l -i)] H/ *^' i - 1) = (1 + 0(<5))np i - 1). 
Using Lemma 5 (ii) and arguments similar to above, for i > 3 with probability 1 — 0(rT 

W*(y, i - 1) = (1 + 0(5) + 0(e i _ 1 ))np i - 1) 

completing the induction for i > 3. 

The final step is to use 

W(y, 2) = (1 + 0(5) + 0(e 2 ))— Wfo, 1), 

and thus whp 

1 1 
W(y,*) = J(l + 0(S) + 0(e l )) 2 — W*(y,l) 

AA Tip 



K \ 



Thus from (22) 

Pr(3y G V such that = 0(Pr(3v G V : deg ± (w) £ J)) = 0(n" 7 ). 

□ 
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Corollary 8. Provided Assumption 1 holds, let 

My) = = (i + o(i/v / i^))^-^l , (38) 

LIS n p J 

Pr (3y G 1/ stzc/j £/iat -v^)) = 0(n~ r ). (39) 

Proof Referring to (37), under Assumption 1 and -CD, then d^iu) = deg~(w)(l — o(l)) 
simultaneously for all u G N~(y) with probability 1 — 0{n~ 1 ~ 1 ). Let ( = 1/ log log log n. A 
vertex is normal if at most £o = |~4/(£ 3 cf)] of its in- neighbours have out-degrees which are not 
in the range [(1 — Qnp, (1 + Qnp], and similarly for in-degrees. Let Af(y) be the event y is 
normal. We observe that 

Pr(^A%) | £-) < 2 E I 5 ) (2e^ ?2 " p/3 ) & = 0(n^( logloglosn )), 

where £~ is given by (23), and thus (see (22)) 

PrHA%)n£-)) = O^- 1 ^). (40) 

Now if y is normal then 

deg"(y)^-0(Co)< £ ^^<deg-(y)i±| + 0(Co). 

□ 

Recall the definition of Z(x,y), 

Z(x,y) = V a liU P liV ™ , (41) 
S de S («) 

where l u „ is the indicator for the existence of the edge (u, v) and we take de g+| M ) = if 
deg + (u) = 0. The next lemma gives a high probability bound for Z{x, y). 

Lemma 9. Let 

A 3 (x,y) = {z(x,y) = (l + 0(e z )) d 



m 

where ez = l/(\/\ogn). Then given Assumption 1, 



Pr(3x,y:^A 3 {x,y)) = 0(n^). (42) 
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Proof 

Let 

B = B(x, y) = {E+^ Xe n £- XYg R A x {x, y) n .A 2 (y) n £), 

where £ is given by (23), A\,A2 by (31), (38), and £ is the event that Lemma 5 holds. 

Let u a Xi and let w 6 Y \ Y%. As X< D 7 = I, we know that u is not an in-neighbour of 
w. Other out-edges of u are unconditioned by the construction of T l ° w \T l ° w . Given Y\Y$ < 
n 2/3+o(i)^ ^Yi e distribution of deg + (w) is Bin(i/,p) for some n — n osr < v < n — 1. Thus 



E 



fe| B ) - t 0^1 - P)-^ = i (1 + 0(»-)) • (43) 



Here k/u is the conditional probability that edge (w, f) is present, given that w has out- 
neighbours. 

We use the notation Prc(-) = Pr(- | C) etc, for any event C. From (31), (38), (43), 

E B {Z) = (l + 0(e z ))^-^-. (44) 

m 

Conditional on £>, |Yg| < n 2//3+o( - 1 ' ) by construction, and as the edges from u to Yi are unexposed, 
Pr B (\N + (u) n y«| > 1000) < Pr(5m(n 2/3+o(1) ,p) > 1000) < n~ 10 . (45) 

Let 

jr = jr(x, y) = { |iV+(w) n < 1000, Vw G X e } , 

and let (?(x, y) = B(x, y) R .F(x, y) PI £i ■ The quantity of interest to us is the value of Z(x, y) 
conditional on Q(x,y). We first obtain Eig(Z) from Eb(Z) using 

E B (Z) = E B (Z • l Hx , y)n£+x ) + E S (Z ■ l.[^ >y)n£ + j). (46) 
The event -<[J-'(x,y) R £%] C [.F(x, y) R ""^xj ^ [ _, ^ r ( x ' 2/)]- Using (41), we obtain 
E B (Z • l AHx>y)nS +^ 

= 0(E B (Z)n^) + O ((^L) 1^1 (0(n-^) + 0(^ 10 )) + 0(|X,||F,|)0(n" 10 ) (47) 



E S (Z) 0(n 



To see this, partition the vertices of Xi into sets R, S, where vertices in R have out-degree in 
[conp, Conp], and vertices of S do not. The first term in (47) is the contribution to the first term 
in the RHS of (46) from the vertices in R, multiplied by the probability of —*£x t - Assuming 
T{x, y) holds, the second term in the RHS of (47) is the contribution to the first term in the 
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RHS of (46) from the vertices in S. The last term in the RHS of (47) is the contribution is 
the contribution to the first term in the RHS of (46) in the case where —iF{x, y) holds. 



Thus 



E(Z ■ I* ■ l Hx , y) n £ + ) E B (Z.l^ )n£ + )Pr(B) 
E g (Z) = 



Pr(0) Pr(0) 

and so 

E g (Z) = E B (Z)(1 + 0(n^)) = (1 + 0(e z ))^-^-. (48) 

np n 

We now examine the concentration of (Z \ Q). Let A = 1000/((1 - o(l))c np) 2£+1 . It follows 
from Lemma 5(ii) that given Q we have Z u < A. Let Z u = Z u /A, then for u e Xe, the Z u are 
independent random variables, and < Z u < 1. Let Z = J2 ueXe Z u and let ju = E g (Z). Thus 

^ = n V3+o(i). (49) 

It follows from (9) that if < 9 < 1, 

Pr g (|Z-/2| > 0£) < 2e"^ /3 . 
With 6 = 4(np/Ji) 1/2 we find that, 

Pr g {\Z - fi\ > 4(np/2) 1/2 ) = o(n" 4 ), 

and hence that 

Pr g (\Z - E g Z\ > 4A(npj2) 1/2 ) = o(n" 4 ). 
Using (49) we have 4A(np /i) 1/2 = 0(?i~ 7/6+o(1) ), and so 

Pr g (\Z- E g (Z)\ = O (-^r-J) )=1- o(n" 4 ). 



n 7/6+o(l 

We see from (48) that E g {Z) = (1 + 0(e z ))^ZM. Thus 



Pr g ( Z(x,y) ? (1 + 0( Car ))^M ) = o(n -4). (50) 



Using (24), (32), (39) and (45), 
Pv(\J^g(x,y) 



< Pr(^V) + Pr(-£) + Pr I \J -J?(x, y))+Pr((J -A (s, y) J + Pr f |J ^A 2 (v) 

= 0(n^). (51) 
Thus finally, from (50) and (51) 

Pr(3x, y : ^A 3 {x, y)) = O(n^). (52) 



□ 
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4.3 Properties needed for an upper bound on the stationary dis- 
tribution 

We remind the reader that np < n s . We first show that small sets of vertices are sparse whp. 

Lemma 10. Let ( be an arbitrary positive constant. For all S C V, \S\ < sq — (1 — 2£)A, S 
contains at most \S\ edges whp. 

Proof The expected number of sets S with more than \S\ edges can be bounded by 

< S ^^{e 2 np) s sep 

s=3 

< exp(-C(l + o(l))logn) = o(rT c/2 ). 

□ 
Let 

We will use the following values: 

£ = (1 + rj)A, h = {l- 10?7)A, £ 2 = II77A. 

For the upper bound we need to slightly alter our definition of breadth-first trees and call 
them T^ p ,Ty P . This time we grow T" p to a depth £\ and Ty P to a relatively small depth £2. 
With this choice, Lemma 10 implies that Y will contain no more than \Y\ edges whp. This 
reduces the complexity of the argument. We fix x, y and grow T" p from x to a depth £1, and 
Ty P into y to a depth £ 2 - The definition of T" p is slightly different from T low , but we retain 
some of the notation. 

Construction of T" p . We build a tree T£ p , much as in Section 4.2, by growing a breadth-first 
out-tree from x to depth £. The difference is that we construct T" p before Ty P , so that T" p 
is not disjoint from Y. As before, let X = {x}, and Xi, i > 1 be the i-th level set of the 
tree. Let T^ p {i) denote the BFS tree up to and including level i, and let T^ p = T™ p (£i). Let 
X<i = Uj<iXj, and let X = X<^. In Section 5.2 below we will need to consider a larger set 
X~< h where £ 3 = (1 - r//10)A. " 

Construction of T^ p . Our upper bound construction of Ty P is the same as for the lower 
bound, except that we only grow it to depth £2. 

Our aim is to prove an upper bound similar to the lower bound proved in Lemma 9. For 
u G Xfr we let 

a £ltU = Pr(W^0 = u) 
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where 

X)a*,«<l. (53) 

The RHS of (53) is one, except when we fail to construct Ty P to level £2- 

This is the only place where we write a structural property of D n p in terms of a walk proba- 
bility. This is of course valid, since oc£ 1)U is the sum over walks of length £\ from x to u of the 
product of reciprocals of out-degrees. Fortunately, all we need is (53). 

We also define the (3i )V as we did in (28) and now we let 

Z(x,y) = 2r*(x,y)= V c^J^-^— . (54) 

v£Y l2 \X 

The following lemma follows from Corollary 8. 

Lemma 11. Let i be as in (25). If2<k<£ then for some ey = o(l) we have 

deg~(y) 



where 7 is as in (22). 



oin- 1 ^ 2 ) 



np 

v&Y k 



It follows by an argument similar to that for Lemma 9 that 
Lemma 12. 

Pr fix, y : Z(x, y)>(l + o(l)) — ^ ) = 0(n"^ 2 ). (55) 



In computing the expectation of Z, some of the vertices in X of T" p may be inspected in our 
construction of Ty P , or of T£> up to level £ x . Thus E(l ut ,/deg + (w)) < (l/n)(l + o(l)), (see 
(43)). 

Remark 5. T/ie wpper bound for Z(x,y) obtained above is parameterized by £0 = (1 + ? ?)A. 
Provided 77 > constant, so that Lemma 12 holds, we can apply this argument simultaneously 
for n 7 / 3 different values of 7]. 

We next prove a lemma about non-tree edges inside X, and edges from X to F \ Yg 2 . 
Lemma 13. 

(a) Let 4 = (1 -77/10)A and 

£0(^3) = {Vz6 ^<£ 3 : z has < 100 / 77 in-neighbours in X<£ 3 } . 
Then Pr(-.£ a (4)) = 0(n" 9 ). 
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(b) Let X° = {v G X £ : JV+(*;) n X< t ^ 0} and 

£ 6 (^) = {\X° e \ < 18A 2£ p + log 2 n}. 
T/ien Pr(^C b (£)) = 0(n~ 10 ) for £ < £ 3 . 

(c) Let 



tc, 



KA 



log np 

Fix t < to and i > 2r]A and let 



where K = 2 log(100C /"c ). 



(56) 



S° t = {z G X : z is reachable from X° in at most t steps} . 
Then let A° = A°(x, y, t) be the number of edges from S° t H X to Yi^-t- Then, 

Pr(3x,w,t : A° > logn) = 0(n~ w ). 

(d) Let A = A(x,y,t) be the number of edges between X^ andY^ t \X , where t < t < £ 2 — 1. 



Pr(3x, y,t: A > 9\X ei \\Y e ^ t \p + log 2 n) 



oin 



-10\ 



Proof 

(a) Let z G X(l 3 ). Let ( be the number of in- neighbours of z in X<£ 3 . In the construction of 
T^ p (£ 3 ), we only exposed one in-neighbour of z. Thus ( is distributed as 1 + Bin(\X<£ 3 \,p) < 
1 + Bin(A e s ,p) + nPr(X>). We apply (10) and (17) to deal with the binomial. Hence if 
r + 1 = IOO/77, 

Pr(C > r + 1) < AJjV + n- 10 e- l0np < 2n r{5 ~ ri/w) + n^V 10 ^ = 0(n" 9 ). 
Part (a) of the lemma follows. 

(b) For v G Xg the out edges of v are unconditioned during the construction of T^ p (£). The 
number of out edges of v to X<? is Bin{\X<n\,p). Unless V occurs, \X<%\ < 2A e and 

Pr(|iV + (t;) nX<^| > I -iD) < 1 - (1 -p) 2A ° C < 2A</p, 
E(\X°\ I -.P) < 2A 2 V 



and 
By (10) 



Pr(|X|| > 18A 2 ^ + log 2 n) = 0(n" 10 ) + Pr(£>) = 0(rT 10 ). 

(c) Let S(u,t') be the set of vertices in X that a walk starting from u G X, can reach in 
h-i + t-t' steps. Thus unless £> occurs, |S(M')I < A ^" i+ '~''. So, given -iX>, 



I^J < 2\X°\A ^ 



h—i+t 



(57) 
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We can assume that, after constructing T™ p we construct Ty P to level Y# 2 _ t , and then inspect 
the edges from S° t to Y^ 2 _ t \X. These edges are unconditioned at this point and their number 
A is stochastically dominated by Bin(\S° t \ |Y^ 2 _ t |,p) . Given £&(i) of part (b) of this lemma, 

\X?\ < 18A 2i p + log 2 n. (58) 

Let % = aA, where 2i] < a < 1 — IO77. 

Case 2rj < a < (1 + e)/2 for some small e > constant. 

Using (57), (58) and \Y i2 _ t \ < Ac/ 2 "* gives 

EA° < (18A 2 > + log 2 fi)2Ao <1 ~ i+ *A /a " t p + n 2 (Pr(X?) + Pr(£ 6 (z))) 

< 3QC e 0+h p 2 {n P Y 0+i + 2C £ °(log 2 n) p {npY ' 1 + 0(n~ 9 ) 

< 36C e 0+h {np) 2 n-? +v+e + 2C e ° log 2 n{np)n~ v + 0{n~ 9 ) 
= 0(n^ 2 ). 

Case (l + e)/2 < a < 1 - IO77. 

For i > (1 + e)/2A, < 20A 2 >. Thus 

EA° < 20A 2 > • 2A <1 - i+ *Ao /a ~*p + n 2 (Pr(P) + Pr(£ 6 (z))) 
< 40C^ +£l p 2 (np) fo+l + 0(n~ 9 ) 

= o{rr^ 2 ). 

In either case, with probability 1 — o(n~ 10 ), A < logn. 

(d) After growing T£ p to level we grow Ty P to level £2 — t. Then has a binomial 
distribution and EA(t) < \XiA\Yi2_t\p. The result follows from the Chernoff inequality. □ 

4.4 Small average degree: 1 + o(l) < d < 2 

This section contains further lemmas needed for the case 1 + o(l) < d <2. 
We will assume now that 

l + o(l) < d < 2. 

Let a vertex be small if it has in-degree or out-degree at most np/20 and large otherwise. Let 
weak distance refer to distance in the underlying undirected graph of D n p . 

Lemma 14. 

(a) Whp there are fewer than n 1 ^ small vertices. 
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(b) If np > 2 logn then whp there are no small vertices. 

(c) Whp every pair of small vertices are at weak distance at least 

= logn 
10 log logn 

apart. 

(d) Whp there does not exist a vertex v with max {deg + (v), deg~(v)\ < logn/20. 

(e) Let s*(v) be given by (1). Whp for all vertices y, 

uei^(y) deg ^ 

Proof 

(a) The expected number of small vertices is at most 



logn/20 . _ . 

E r , )p k Q M = 0(n ^). (59) 



n 

k=0 



k 

Part (a) now follows from the Markov inequality. 

(b) For np > 2 logn the RHS of (59) is o(l). 

(c) The expected number of pairs of small vertices at distance £ w or less is at most 



tlO I logn/20 

n 2 Y2 k n k p k+1 2 V ( '\ 1 )p l q n - 



k=0 \ 1=0 v ' 

O(ne l0 (2d\ognY 10+1 (20edy° sn/w n- 2d ) = 0(n ■ n l ' w+o{ ^ ■ n 1 ' 2 ■ n~ 2 ) = o(l). 
(d) The expected number of vertices with small out- and in-degree is (9(t7, 1 - 2x - 8002 ) = o(l). 



(e) For 1 < k < A let 

Aft 



1 1 < < ( log l ogn )4 

(log logn) 4 (i^^<^<A - 



Let e = log ^ n ■ The probability that there exists a vertex of in-degree k G [1, Ao] with 
in-neighbours of in or out-degree outside (1 ± e)np, is bounded by 
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Now assume that there are fewer than neighbours of v of in or out-degree outside (1 ±e)np. 
Assuming at most one neighbour w of y is small, 



deg Q) 
deg + (w) 



(l + 0(e))k 

(l + 0(e))(k-X k ) + 0(X k ) 




log n 



E 



(log log 7l) 4 

< k < A, 



u<=N-(y)\{w} 



(log log n) 4 







This completes the proof of the lemma. 



□ 



Let weak distance refer to distance in the underlying graph of -D n p , and let a cycle in the 
underlying graph be called a weak cycle. 

Lemma 15. Whp there does not exist a small vertex that is within weak distance £\o of a 
weak cycle C of length at most t\§. 



5 Analysis of the random walk: Estimating the station- 
ary distribution 

In this section we keep Assumption 1 and assume that we are dealing with a digraph which 
has all of the high probability properties of the previous section. 

5.1 Lower Bound on the stationary distribution 

We use the properties described in Section 4.2. We derive a lower bound on P^ l+l {y). For 
this lower bound we only consider (x, y)-paths of length 2£ + 1 consisting of a T l ^ w path from 
x to Xi followed by an edge from to Yi and then a T l ° w path to y. The probability of 
following such a path is Z(x, y), see (29). Lemma 42 implies that 



Proof Let v : C be such a pair. Let |C| = i and j be the weak distance of v from C. The 
probability that such a pair exists is at most 




O ( n l/10+o(l) . n V10+o(l) . n -4/5+o(l)) = o(l) 



□ 



P^ +1 \y)> (i-o(i)) 



deg (y) 



for all v G V. 



(60) 
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Lemma 16. For all y £ V , 

> (1 -O(l)) . 

y m 
Proof It follows from (60) that for any y £ V, 

/it / / £ 

□ 



5.2 Upper Bound on the stationary distribution 

Lemma 16 above proves that the expression in Theorem 2 is a lower bound on the stationary 
distribution. As^Wy — 1, this can be used to derive an upper bound of 7r y < (l + o(l)) deg ^ 
which holds for all but o(n) vertices y. In this section we extend this upper bound to all 
yeV. 

We use the properties described in Section 4.3. We now consider the probability of various 
types of walks of length £q + 1 from x to y. Some of these walks are simple directed paths in 
BFS trees constructed in a similar way to the lower bound, and some use back edges of these 
BFS trees, or contain cycles etc. We will upper bound P% 0+1 (y) as a sum 

P l x 0+ \y) < Z e x 0+1 (y) + Sf +1 (y) + Q[ 0+ \y) + R x a+1 (y), (62) 
where the definitions of the probabilities on the right hand side are described below. 



Z* 0+1 (y). This is the probability that W x (lo + 1) = y and the (£i + l)th edge (u,v) is such 
that u £ X and v £ Yi 2 \ X, and the last £ 2 steps of the walk use edges of the tree Ty P . 
These are the simplest walks to describe. They go through T" p for t\ steps and then 
level by level through Ty P . They make up almost all of the walk probability. 

S^ +1 (y). This is the probability that W x (^o + 1) = V goes from x to y without leaving X. 
This includes any special cases such as, for example, a walk xyxy...xy based on the 
existence of a cycle (x, y), (y, x) in the digraph. 

Q^ +1 (y)- This is the probability that W x (£o + 1) = y and the {l\ + l)th edge (u, v) is such 
that v £ Ye 2 D X and the last £ 2 steps of the walk use edges of the tree Ty P . We exclude 
walks within X that are counted in S^ +1 (y). 

R^ +1 (y). This is the probability that W x {£q + 1) = y and during the last £ 2 steps, the walk 
uses some edge which is a back or cross edge with respect to the tree T" p . 
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Upper bound for Z^. 0+1 (y). 
It follows from (55) that 

Upper bound for S i x 0+1 {y). 



zi a+1 {y)< (i + (l))^p. (63) 



Let W x be a walk of length t in X, and let W x (t) = v. Let rf~ ax = max^ e x \N~(w) D X\. 
Tracing back from v for t steps, the number of walks length t in X terminating at v is at most 
(^max)*> so this serves as an upper bound on the number of walks from x to v of this length. 
By Lemma 13(a), we may assume that <i~ ax < IOO/77. 

Applying this description, there can be at most (100/?7) £o+1 walks of length £0 + 1 from x to 
y, which do not exit from X. We conclude that 



\ CQnp J \n 1+r i/ 2 



(64) 



Upper bound for Q x 0+1 (y). 

We say that a walk W x delays for t steps, if W x exits X for the first time at step £\ + t. A 
walk delays at level i, if the walk takes a cross edge (to the same level i) or a back edge (to a 
level j < i) i.e. a non-tree edge e = (u, v) contained in X that is not part of T" p . 



Lemma 17. Let t 







KK 
log np 



where K = 2 log(100C /?7Co), then 



Pr(yV x (£o + 1) = y and W x delays for to or more steps) = o(l/n) 



Proof The only way for a walk to exit from X is via Xi t (recall that edges oriented out 
from Xi end in Xi+i). Let W x be an (x, y)-wa\k which delays for t steps, and then takes edge 
e = (u,v) between X^ and Yi 2 -t \ X. There are at most (100/?7) £l+ * walks of length £1 + t 
from x to u within X. After reaching vertex v, W x follows the unique path from v to y in Ty. 
Applying Lemma 13(d) we see that the total probability P^(t) of such walks of length 

^o + l and delay t is 

(100/77)^+* (9\X h \\Ye 2 - t \p + log 2 n) 

(c npY a+1 
(100/ 'r]Y 1+t (9Co°(np) £ °-V + log 2 n) 
(c np)^ +1 

Q (l_ / (IOO/t^ YY 1 | log 2 n\y 



Pt(t) < 

< 
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So, 

where A = (100C /r/c ) 1+r '. 

Now A A = Also A A /np = o(l) if log 2 np > 2(logA)(log n) in which case the RHS of 

(65) is 0(1/72), which is what we need to show. So assume now that log 2 np < 2(log A)(logn). 
This means that A — > 00 and then 

A A < (A_ \ A _ 
(np) to ~ \e K J 

Thus in both cases 

P\> t ) = o(l/n). (66) 

□ 

We can now focus on walks with delay t, where 1 < t < to- A non-tree edge of X is an edge 
induced by X which is not an edge of T" p . For 4z < (1 — e)A, Lemma 10 implies that whp 
the set U = X<i contains at most \S\ edges. For, if U contained more than \U\ + 1 edges then 
it would contain two distinct cycles Ci, Ci- In which case, C%, C2 and the shortest undirected 
path in U joining them would form a set S which satisfies the conditions of Lemma 10. Thus 
there is at most one non-tree edge e = (u,v) contained in A<( 1 _ £ )a/ 4 . 

Let 9 = 2t]A. We classify walks into two types. 



Type 1 Walks. These have a delay caused by using a non-tree edge of X<e, but no delay 
arising at any level i > 9. Thus, once the walk finally exits Xg to Xg + i it moves forward 
at each step towards X^, and then exits to Y^ 2 _ t \ X. 

Type 2 Walks. These have a delay arising at some level Xi, i > 9. We do not exclude 
previous delays occurring in X<g, or subsequent delays at any level. 

Type 1 Walks. We can assume that X< e induces exactly one non-tree edge e = (u,v). Let 
u G Xi then v G Xj, j < i. There are two cases. 

(a) e is a cross edge, or back edge not inducing a directed cycle. 

Here the delay is t = i + 1 — j and this is less than to by assumption. Then, as we will see, 

Pr (Type 1(a) walk) < 1 * £ Z£o-*+*+%) = O ( . (67) 

{conpf deg + H w ^t {v) \n{np) 2 J 

The term l/(conp) 2 arises from the walk having to take the out-neighbour of x that leads 
to u in T™ p and then having to take the edge (u, v). The next step of the walk is to choose 
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w G N + (v) and it must then follow a path to y level by level through the two trees. The value 
of Z^~ l+ i +1 {y) can be obtained as follows. Let £' = £ — (i — j) — 1, then as t < t = o(A) we 
have that £' ~ £q. For w G N + (v) replace £q by £' in (55) above, to obtain Z(w, y) = 0(l/n), 
see Remark 5. This verifies (67). 

(b) e is a back edge inducing a directed cycle. 

Let xPu be the path from x to u in T^ p (8). As v is a vertex of xPu, we can write xPu = 
xPv,vPu and cycle C = vCv = vPu, (u,v). Let a > 2 be the length of C. For some w in 
vPu the walk is of the form xPv, vPw, (wCw) k , wPz, where wCw is C started at w, the walk 
goes round wCw, k times and exits at w to u' G N + (w) \ C and then moves forward along 
wPz to z G and then onto y. The delay is t = ka and this is less than t by assumption 

We claim that 

(68) 



w£C k>l ° v y «'eA r +(iu)\C 



The term (conp) kcr accounts for having to go round C k times and we can argue that 
Z^r ka+1 (y) = 0(l/n) as we did for Type 1(a) walks. 

So from (67) and (68) we have that 

Pr(Type 1 walk) = O ( ) . (69) 

\n[npY J 

Type 2 Walks. Suppose W x is a walk which exits X at step £±+t and is delayed at some level 
i > 9 by using an edge (u, v). The walk arrives at vertex u £ Xi for the first time at some 
step i + t' and traverses a cross or back edge to v G X,-, j < i. 

A contributing walk will have to use one of the A°(x,y,t) < logn edges described in Lemma 
13(c). By Lemma 13(a) there are at most (100/r]Y 1+t logn from x to u G X°. Once the walk 
reaches to G Yg 2 -t there is (by assumption) a unique path in Ty P from w to Let P(i,t) be 
the probability of these Type 2 walks, then 

(100 A/)^ logn 

l ' JS (c np)^+i U [ n i + v/2j- ^ 

Thus finally from (66), (69), (70) 

„ „ /l 1 \ 

(71) 



Qi a+ \y) = P\> to) + Pr(Type 1 walk) + ^ ^ P(i,t) = O Q Aj) ■ 

i<t<t e<i<4 

Upper bound for i?^ 0+1 (y). 
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Let Y = Y<i 2 be the vertex set of Ty P (£ 2 ). We assume that Y induces a unique edge e = (u, v) 
which is not in Ty P . Note that the condition that \Y\ induces at most \Y\ edges holds, even 
if we replace £ 2 with 2£ 2 based on the construction of Ty (2£ 2 ) to depth 2£ 2 , by branching 
backwards from y. We consider two cases. 

(i) e is a cross or forward edge, or back edge not inducing a directed cycle. 

We have u £ Yi,v £ Yj for some i < j < £ 2 . We suppose the (x,y)-walk is of the form 
xWu, (u, v),vWy where u £" vWy, so that vWy is a unique path in Ty P . 

Case 1: i > (4rj/5)A. 

Let £3 = (1 — r//10)A. The length of the path (u, v),vWy is j, so the length of the walk xWu 
is £0 — j + 1. Let h be the distance from u to X<^ 3 in T^ p . Then 

/i = max {0, £ -£ 3 -j + 1} < max {0, £ - £ 3 - % + 1} . 



Let to £ X<^ 3 . By Lemma 13, the number of (x, u>)-walks of length £ < £ 3 in X<£ 3 passing 
through w at step £ is bounded by (100/rjY 3 . The the number of walks length h from u to 
X<^ 3 is at most A h . Thus, the number of (x,y)-walks passing through e = is bounded 

by (IOO/t^AcA Thus 

^-H ^yi "'"' 1 "" 1 ' <72) 



Case 2: < i < (4t//5)A. 

Let i = aA. Let 77' = 77(1 - a) , f x = (1 - 10r/)A, £' 2 = II7/A, and let £(, = f x + £' 2 . As observed 
above, the vertex set U of the tree Ty of height £' 2 above u induces no extra edges, so we can 
apply the upper bound result for walks of length £' + 1 from x to u based on the assumption 
R e } +1 (u) = 0. Thus 

pyV)<(i + 0( i))^M. 

m 

The probability the walk then follows the path (u,v),vPy is 0(1/ (np) 2 ). Thus 

«=»g)' (73) 



(ii) e is a back edge inducing a directed cycle. 

In this case, there is an edge e = (u, v) where u £ Yi, v £ Yj and j > i. Let t;P« denote the 
path from v to it in Ty P , and C the cycle vPu, (u, v). There is some k > 1 such that the walk 
is P = xPu, (uCu) k , uPy. Let a be the length of C, let r be the distance from u to y in T" p , 
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and let s = r + ka. Let £ = £q — s. Then £ + 1 is the length of the walk xPu from x to u 
prior to the final s steps. 

Either £ < (1 + 4r//5)A and the argument in Case 1 (i > 4r//5)A) above can be applied, giving 
us the bound 

^■"( ^ l-^ (74) 

Or£>(l + 4?7/5)A and we adapt Case 2. Let w be the predecessor of u on P . We can use 
Remark 5 as above to obtain P^.{w) < (1 + o(l))deg~ (w) /m. As ka > 2, r > 0, (the worst 
case is u = y, w £ N~(y)), we obtain 



Thus, using (72), (73), (74), (75) we have 



(75) 



^ )=0 (bi^)- (76) 

We have therefore shown that S^, 0+1 (y) + Q e ^ +1 (y) + i?f. 0+1 (|/) = o(l/n) completing the proof 
that 

Pi° + %)<(l + o(l))^M (77) 

m 

Lemma 18. For all y £ V } 

7T y = (1+0(1)) - ■ 

Proof It follows from (60) that for any y EV, 

5>PJ<° + «M < + = (1 + 0(1))^. (78) 



7Ty 



m 14 — ' m 

x&V x£V 



The lemma now follows from Lemma 16. □ 
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6 Stationary distribution: Removing Assumption 1 



6.1 Large average degree case 
6.1.1 np > n 5 . 

We can deal with this case by using a concentration inequality (79) from Kim and Vu [13]: 
Let T = (W, E) be a hypergraph where e £ E implies that |e| < s. Let 

Z = S Tw e \\ Zj 

where the w e ,e £ E are positive reals and the z^i £ W are independent random variables 
taking values in [0, 1]. For A C W, \A\ < s let 

Z A = ] [ Zi. 

e£E i£e\A 
e~DA 

Let Ma = E(Z^) and Mj(Z) = maxA,\A\>j for j > 0. There exist positive constants a and 
b such that for any A > 0, 

PrflZ - E(Z)| > aAV^oMO < 6|W| S -V A . (79) 

For us, W will be the set of edges of K n the complete digraph on n vertices. Zi will be the 
indicator variable for the presence of the ith edge of K n . E will be the set of sets of edges in 
walks of length s — \2/5] between two fixed vertices x and y in K n , and w e — 1. Z will be 
the number of walks of length s that are in D n>p . In which case we have 

e(z) = (i + o(i))r2 s -y 

Mj < (1 + o(l)K- J '-y- j < (1 + o(l))E(Z)/np for j > I. 
So M = E(Z) and applying (79) with A = (logn) 2 we see that for any x,y we have 
Pr(|Z-E(Z)| = 0(E(Z)n~ s I 2 \og° {1) n)) = l-0(n~ 3 ). 

Thus whp 

p^)Hi+o(i)) (( /rX)- 4 

We now finish with the arguments of Lemmas 16 and 18. □ 
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6.2 Small average degree case 



6.2.1 Lower bound on stationary distribution 

A vertex is small if it has in-degree or out-degree at most np/20 and large otherwise. In the 
proofs of Section 4.2 we assumed x,y were large. We proceed as in Section 5.1 but initially 
restrict our analysis to large x, y. Also, with the exception of Yi we do not include small 
vertices when creating the Aj,Yj. Avoiding the < n 1//5 small vertices (see Lemma 14(a)) is 
easily incorporated because in the proof we have allowed for the avoidance of ?t, 2 / 3 + ( 1 ) vertices 
from Ui^« e ^ c - Provided there are no small vertices in N~(y), our previous lower bound 
analysis holds. In this way, we show for all large x, y that, 

Pi 2 " +1 \y)> (i-o(i))^M. (so) 

m 

If x is small, then it will only have large out-neighbours (see Lemma 14(c)) and so if y is large 
then 

P^\y) = -4— £ Pf+%)>(1- (1))^^. (81) 
A similar argument deals with small y and x arbitrary i.e. 

J^) deg (*) m deg+ w ™ 

(82) 

We have used Lemma 14(e) to justify the last inequality. 

In the case that some u G N~(y) has small out-degree, then by Lemma 14(c) there is at most 
one such u whp. For z G N~(y), we repeat the argument above for each factor P£ e+1 (z). The 
extra term s*(y) now arises from deg _ (tt)/deg + (w) and 

>(2m). 



^ +2) w= E W^sa-oW) 1 E ^4>(i-o ( i))^£M±^. 

z — , deg (z) m ^— ' deg m m 

We can now proceed as in (61). 



6.2.2 Upper bound on stationary distribution 

We first explain how the upper bound proof in Section 5.2 alters if Assumption 1 is removed. 
The assumption that the minimum degree was at least c np was used in the following places: 

1. We assumed in Section 4.2 that deg + (x), deg~(y) > c np. These assumptions can be 
circumvented by using Lemma 14(c) with the methods used in the lower bound case. 
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2. In (64), (70), (72). In these cases we used (c npY as a lower bound on the product of 
out-degrees on a path of length A for some A > £%. Using Lemmas 14 and 15, we see that 
small vertices are at weak distance at least £io and therefore there can be at most 11 
such vertices on any walk length £ + 1. Thus, after dropping Assumption 1, we replace 
this lower bound by (conp) A-n , and the proof continues essentially unchanged. 

3. In the proof of Lemma 9 we made a re-scaling B = 1000 / (conp) 2e+l . The exponent 
2£ + 1 was replaced by £ + 1 in the proof of (55) in Lemma 12. We now replace £ + 1 
by £ - 10. 

4. In the proof of Lemma 7 we made a re-scaling U(i) = W(y,i) ■ (coup) 1 at each level 
3 < % < £■ Assume that 2£ 2 < £\o i.e. rj < 1/250 so that there is at most one small 
vertex u in Y . If we replace {c^np) 1 by (cQnp) % ~ 1 does not affect our concentration 
results, provided % > 3. The bounds on U v are now (c /np)(c /C o y < U v < 1, and 
6i — 1/ a/ (A logn)*~ 2 . If the small vertex u £ N~ (y) then the direct calculations used in 
the lower bound hold. If the small vertex u is in levels % = 2, 3 this adds an extra term 
of 0(deg~(u)/(m(np)* -1 )) to our estimate of Z[p +1 (y) in Section 5.2. 

5. In (68), (73), (75). It follows from Lemma 15, that if e.g. Ty P contains a non-tree edge, 
then no vertex of T" p is small, and the calculations in the proof are unaltered. 



Thus the proof as is works perfectly well if we assume that y is large and if it has no small 
in-neighbours and there is no small vertex in Y. We call such a vertex y ordinary. 

If y is small then from Lemmas 14 and 15 we can assume that all of its in-neighbours are 
ordinary. This is under the assumption that 2£ 2 < £\o e.g. if 77 < 1/250. So in this case we 
can use Lemma 14(e) and obtain 

P (W( V) - V Pi '° +1) ^ < 1 + °( 1 ) v deg ' (e) (i 1 o(i)) deg ~ {y) 

x [V) ^ deg + (0 ~ m ^ deg + (0 [i + 0[i,) m ' 



Suppose now that y is large and that there is a small vertex u G Y . We can assume from 
Lemma 15 that Y does not contain any edge not in Ty P . Either u £ N~(y) or, if not, from 
point 4. of the discussion above, an extra O (deg~ (u) j (m(np))) is added to Z^ +1 (y) for the 
probability of the (x, |/)-walk going via u. 

In the case where u £ N~(y) then as in the lower bound 

l + o(l) / deg~(w) deg~(w 



- (1 + 0(1)) (deg-(y) + <*(y)). 



m 

We have now completed the proof of the asymptotic steady state without Assumption 1. 
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7 Mixing time and the conditions of Lemma 3 

7.1 Upper Bound on Mixing time 

Let T be a mixing time as denned in (4) and let I = 0(log np n) be given by (25). We prove 
that (whp) T satisfies 

T = o(i\ogn) = o((logn) 2 ). (83) 

Define 

d(t)= max :\PP-PV\ (84) 

x,x'£V 

to be the maximum over x, x' of the variation distance between Px^ and P^) . It is proved in 
Lemma 20 of Chapter 2 of Aldous and Fill [1] that 

d(s + t)< d(s)d(t) and max \P® - n x \ < d{t). (85) 

X 

Equation (42) implies that whp 

d(2£ + l) = o(l), (86) 
and so (83) follows immediately from (85) and (86). 

7.2 Conditions of Lemma 3 

We see immediately from (83) that Condition (b) of Lemma 3 is satisfied. 
We show below that whp for all v G V 

R T (1) = l + o(l). (87) 

Using (87), the proof that Condition (a) of Lemma 3 is satisfied, is as follows. Let A = 1/KT 
as in (6). For \z\ < 1 + A, we have 

T T 

R T (z) > 1 - > 1 - (! + J>* = 1 - °W- 

i=l t=l 

Thus for v £ V, the value of p v in (7) is given by 

^ = (1 + (1))^-M. (88) 
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Proof of (87): If d > (logn) 2 , then the minimum out-degree of D np is Q(dlogn). In which 
case we have for any x, y 

Pr(W„(t) = y\ W„(t-1) =x) = ( — j ■ (89) 

\ a log n / 

The expected number of returns to v G V by W v during T steps, is therefore 0(T '/ d logn)= o(l). 
Now assume that d < (logn) 2 . 

(i) Lemma 10 implies that if H is the subgraph of D HjP induced by vertices at weak distance 
at most A/20 from v then H contains at most |V(i2")| edges. 

(ii) Lemma 14 implies that there is at most one small vertex in H. 

(iii) Lemma 15 implies that there is no small vertex within weak distance 10 of a weak cycle 
of length < 10. 

Assume that conditions (i), (ii), (iii) hold. Let A 4 denote the set of vertices u ^ v such that 
D n p has a path of length at most 4 from u to v. We show next that: 

With probability 1 - 0(l/(np) 2 ), W v (i) <£ A 4 , 1 < i < 4. (90) 

For this to happen, there has to be a cycle C of length at most 8 containing v. If such a cycle 
exists then all vertices within weak distance 10 of v have degree at least np/20. Furthermore, 
the only way that the walk can reach A4 in 4 or less steps is via this cycle. This verifies (90). 
Assume then that W v (i) £ At, 1 < % < 4. 

Suppose next that there is a time T\ < T such that W v (Ti) = v. Let 

T 2 = min {r < T\ : W v (t) G A4, r < t < Ti}. It must be the case that d(T 2 ) = 4 where d(t) is 
the distance from W v {t) to v. 

If A 4 does not contain a small weak cycle then the walk must proceed directly to v in 4 steps. 
The probability of this is 0(l/(np) 3 ), since at most one vertex on the path of length 4 from 
x = yViT^) to v will be of degree at most np/20. 

If there is a small weak cycle C then there is an edge e of C whose removal leaves an in- 
branching of depth 4 into v. There are now 2 paths that W can follow from x to v. One uses 
e and one does not. Each path has a probability of 0(1/ (np) 4 ) of being followed. Putting this 
altogether we see that the expected number of returns to v is 0(1/ (np) 2 + T/(np) 3 ) = o(l). 
This completes the proof of (87). 
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8 The Cover Time of D UjP 



8.1 Upper Bound on the Cover Time 



in) -> 



For np = dlogn, d constant, let to = (1 + e ) (^l°g (^zj)) nlogn. For np = dlogn d = d(i / 
oo let to — (1 + e)nlogn. In both cases we assume e — > sufficiently slowly to ensure that all 
inequalities below are valid. 

Let Tjj(u) be the time taken by the random walk W u to visit every vertex of D. Let U t be the 
number of vertices of D which have not been visited by W u at step t. We note the following: 

C u = E(T D (u)) = ^Pr(T D (u)>t), (91) 

Pr(T D (u) > t) = Vv{T D {u) > t - 1) = Pr(C/ t _! > 0) < min{l,E(tf t _i)}. (92) 

Recall that A v (t) denotes the event that W u (t) did not visit vertex v in the interval [T, t\. It 
follows from (91), (92) that for any t > T, 

C u <t + 1 + J2 E(^) < t + 1 + Pr (M*))- (93) 



S>t V s>t 



Assume first that d(n) — > oo. If s/T — > oo then (8) of Lemma 3 together with the value of p v 
given by (88), and concentration of in-degrees implies that 



(1 -° (1) H + Q(e-^). 
n 



Pr(A v (s)) < (1 + o(l)) exp <{ }> + 0( e - S2 ^ >). 

Plugging (94) into (93) we get 

C u < t + l + 2n^(exp{-2^^)+O(e-^/ T ))) (95) 

s>t ^ ^ ' ' 

< t + 1 + 3n 2 exp " | + 0{nTe~ n ^) 

= (l + o(l))t . 

We now assume that d is bounded as n — > oo, and the conditions of Lemma 4 hold. For v G V 
we have 

Pr(A,(a)) = (1 + o(l)) exp {-(1 + o(l/ logn))^} + 0(e~ n ^) 
where, by Lemma 16, 

,,> ( i-o(i))*£M. 
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In place of (95) we use the bounds on the number of vertices of degree k given in Lemma 4, 
in terms of the sets Ki, i = 0, 1, 2, 3. Thus 

3 

C u <t + l + o(l) + Y,Si (96) 



i=0 



where 



S, = EDWE exp{-(l^} 



k£Ki s>to 



^ k 



keKi 

(l+e/2)ft 



< 



The main term occurs at i = 3. Using (12), (15), the fact that (nep(d—l))/(kd)) k is maximized 
at k = np(d — l)/d, and m = dn\ogn(l + o(l)) whp, we see that 



8m. (nep\ k fd—\\ 



(l+e/2)fc 



fc=conp 

< 8m, A e~ eConp/M 

= o(t ). (97) 

Note that K Q = 0. We next consider the cases % = 1,2. For i = 1, we refer first to Lemma 
4(i-a). If d - 1 > (logn)" 1 / 3 then #1 = 0. If d - 1 < (logn)" 1 / 3 , then D(k) < (loglogn) 2 , 
from (13). In this case t = 0((l/(d— l)))dn\ogn. Thus 



(log log n) 2 f d — 1 



(l+e/2)fe 

k \ d I 

keKi 
15 

- m S fe v d 

k=l 



= O(t )(loglogn) 2 (d-l)" e / 2 

= o(t ) (98) 
For i = 2, by Lemma 4 if d — 1 < (logn)" 1 / 3 and > 16, and using (14) we have D(k) < 
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(logn) 4 . Thus 

D(k) f d - l\ [1+e/2)k 

k£K 2 



^ W 4 « fa - i\( 1+e / 2 ) fc 



fcGi<r 2 

= O(t )log 4 n(logn)- (19/3+e/8) 

= o(t ). (99) 

If d— 1 > (logn) -1 / 3 then by Lemma 4(i-a) min{fc G A2} > (logn) 1 / 2 , and |A^| = O(loglogn). 
Thus, as d is bounded 



s 2 = O(to) ^ ^(i-D^) 



(l+e/2)fc 



fc>(logn) 1/2 

= o(t ) (100) 
The upper bound on cover time of C u < to + o(to) now follows from (96)-(100). 



8.2 Lower Bound on the Cover Time 

For np = dlogn, let ti — (1 — e)<ilog (xM nlogn. Here e — > sufficiently slowly so that all 
inequalities claimed below are valid. 

Case 1: np < n s where < 5 <C n is a positive constant. 

Let k* = (d— 1) log n, and let V* — {v : deg~(i>) = k* and deg + (v) = dlogn}. Whp the size 
\V*\ > n* = 47rlogn ^_ 1 ))i/2 ( see Lemma 4(h)). Let us first work assuming d > 1.05. In this 
case 7d = (d — 1) ln(d/ (d — 1)) > .15 and we write n* = n 7d- ° ( - 1 ). The maximum degree in D 
is at most A = 0(np) and so V* contains a sub-set V* of size n 7d//2 such that v, w G V* and 
x G V implies 

dist(x,v) + dist(x,w) > A/100. (101) 
dist(y,x) + dist(x,y) > A/50, fory = v,w. (102) 

Here " dist" refers to directed distance in D n p and recall that A = log n. 

Each v G V* has ir v ~ and so we can choose a subset V** of size > n" ld ^ 3 such that if 
vi,v 2 G V** then 

1 < s 

F«i - n V2 \ < — : — — • (103) 
n log n 
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Indeed, suppose that tt v G ^4-^ for v G V*. Divide this interval into log iU n equal 

sized sub-intervals and then use the pigeon-hole principle. 

Now choose u £ V** and let denote the set of vertices in V** that have not been visited 
by W u by time t\. Then E(|^|) — > oo, as the following calculation shows; 

E(|0|) > n W3 ^xp |- (1 + | _ ( e -nfa/T)^ _ T> 

where the last term accounts for possible visits before time T. 

Now assume that 1 + o(l) < d < 1.05. In these circumstances we have n* = log^n where 
u; — > oo, see (19). Equations (101), (102) now hold for all v,w G V*. This follows from 
Lemma 14 because the vertices of V* are small. The size of V** is at least n*/(logn) 10 and 
we can again write 

B(|vt|) > fexp(- (1+0(1))<: ' t ' Uo(e-""-^; 



(log n) 10 \ [ m 
—> oo. 

As in previous papers, see for example [5] , we will finish our proof by using, the Chebyshev 
inequality to show that ^ whp, thus completing the proof of Theorem 1. This will 
follow if we can prove that 

VarQV^I) = o(E(|V t | 2 ) + 0(\V**\ 2 n~ 2 ) = o(E(| V'\f). 

To establish this inequality, we will show that if v,w G V** then 

Pr(A,„(t 1 ) n A w (ti)) < (1 + o(l))Pr(A„(t 1 ))Pr(A UJ (t 1 )). (104) 

To prove this, we identify vertices v, w into a "supernode" a to obtain a digraph D a with n — 1 
vertices. In this digraph cr has in-degree deg~(v) + deg~(w) = 2k* and out-degree 2<ilogn. 

The stationary distribution of _D CT . 

Let it* denote the vector of steady states in D a . The arguments we used in Sections 4 and 5 
remain valid in D a , and thus 

K * 1_ 1 

m 

However, we need to be more precise. For a vertex x of _D CT let 




We will prove for all x G V(D a ), that 

\n(logn) 8 ' 
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Proof of (105). 

Let £ = 7T — it* be the difference between tt and 7r*. Let P* be the transition matrix of the 
walk on D a , then 

{P(x,y) x,y^a 
(P{v,y) + P(w,y))/2 x = a . 
P(x, v ) + P(x, w) y = a 

Let £' be the transpose of £. It follows from the steady state equations that 

x i N + ({v,w}) 







-< 


(en, = | 




"< + 









We rewrite this as 

£'(I-P*)=t/ (106) 
where rj x = for x N + ({v,w}) and |%| < \ir v — ir w \/2 otherwise. 

Multiplying (106) on the right by M = YlJ^o (P*Y we have 

- P*)M = - (P*) T ) = rfM. (107) 

Let 

(P*) T = U + E (108) 

where II is the (n — 1) x (n — 1) matrix with each row equal to (tt*)'. The definition of T 
implies that each entry of E has absolute value bounded by n -3 . 

Now write £ = an* + ( where C -L n* . It follows from (n*)'P* = (tt*)' and (107) that 
(an* + C)'(/ - (P*) T ) = CV - ( P *V) = CV ~ n - E ) = V'M. 

Now 

C(i — e) = C(i - (P*f + n) = t]'m + c'n. 

As ( _L 7r* this implies that 

C(I - E)C = V'MC (109) 

Note that 



T-l 

|i/MC|<X)l'/( p *) t CI<rMICI, (no) 



t=0 

where \z\ denotes the li norm of z. 
Now 

'n-\ 



|C'(/ - E)C\ > |C| 2 - imi > ICI 2 - n" 3 £ |C t | > |C| 2 (1 - n- 2 ). (Ill) 



.1=1 
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It follows from (109), (110) and (111) that 

|C| 2 (l-n- 2 )<TM|C| 

and so using (103) we find that 



Kl =0( -77-^-^ ) • 



n(logn) 8 

Now let 1 denote the (n — 1)- vector of l's. Then 

= 1-1 = (7T- 7T*)'l = f' 1 = a + C'l- 

Using (112) this gives 

H<|i||d = o 



n 1 / 2 (lojs; i 



Now £ x = an* + ( x for all x and so 



£ < 2a\n* x f + 2C 2 = O ( -r^-j, ■ - + -^-^ ) = O 



n(logn) 16 n 2 n 2 (logn) 16 / \n 2 (logn) 16 
This completes the proof of (105). □ 

Proof of (104). 

For v G V**, we first tighten (87) to 

i?„ = l + o(l/(logn) 2 ). (113) 

Assume first that np < log 10 n. Then (101) and (102) imply that for 1 < t < (logra) 2//3 , 
vertex v will be at distance > 21og 2 ^ 3 n — t from W v (t). Then once the walk is at a vertex 
w within distance log 2,/3 n of v its chance of getting closer is only 0(1/ log n). This being 
true with at most one exception for a vertex of low out-degree. The probability that there 
is a time t such that W v is within log 2 ^ 3 n of v and it makes 10 steps closer to v in the next 
100 steps is 0(T/ log 9 n) = 0(1/ log 7 n). This implies (113). If np > log 10 n then we use 
Rv < 1 + (1 + o(l))T/np. 

Similarly, 

R a = l + o(l/(logn) 2 ). (114) 

The mixing time T in what follows is the maximum of the mixing times for D and the 
maximum over v, w for D a . Using the suffix Pr^ to denote probabilities related to random 
walks in D a and using (105), it follows that 

Pr M )) < exp(- ' 1+0 ' r <»< f ' }-o (e -"«-^) 

m 



(1 +o(l/logn))(7T„ + 7T w )t l 

< exp < > — o{e 

m 



a(ti/Th 

1 + o(l))Pr(A u (t 1 ))Pr(A. u ,(t 1 )). (115) 
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But, using rapid mixing in D a , 

Pr^Arih)) = ^2P^ u (x)Pr a (W x (t)^ a,l <t<h-T) 

x^a 

= Y^i< + 0{n^))Pv a {W x {t) ^a,l<t<h-T) 

On the other hand, 

Pr(A,(*i) n A w (ti)) = p «( z ) Pr (>V*(i) + v, w, T < t < h) 

= Yl + 0(n- 3 ))Pr(W x (t) ± v, w, 1 < t < h - T) 

x^v,w 

But, 

Pr a {W x {t) ^ a, T < 1 < t x - T) = Pv{W x {t) ^ v, w, 1 < t < h - T) 

because random walks from x that do not meet v,w or a have the same measure in both 
digraphs. 

It follows that 

Pr(A„(^) n A w (tx)) - P r<r (Ar(*i)) 

= - < + 0(n- 3 ))Pr(W x (t) ^v,w,l<t<h-T) 

x^=v,w 

< 



o(—^—\ V PriW^^v^^KtKh-T) 
Vnlog n) ~^ 

< O ( — V) E ^TY Pr (^(0 ^v,w,l<t<t x -T) 

\n\oin) X w PZ{x) 

< O f-r^s- ) O(nlogn) V P w T ( a; )Pr(W a; (t) ^ v, u>, 1 < t < h - T) 
Vnlog n) ' 



since P^(x) = f2(l/nlogn) 



< o (-^-) Pr(A,(ti) n (H6) 

\log n/ 

Equations (115) and (116) together imply (104). □ 
Case 2: np > n s . 

In this range we take t\ — (1 — e)nlogri and let ^* be the set of vertices of degree \np\ . A 
simple second moment calculation shows that whp we have |V*| = ^((np) 1 / 2-0 ^). We then 
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choose e so that E(|W|) > (np) 1 / 4 . It is then only a matter of verifying (104). The details 
are as in the previous case. 

This completes the proof of Theorem 1. □ 

Acknowledgement: We thank several referees whose insight and hard work has helped to 
make this paper correct and (hopefully) more readable, 
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